Overview

Dataset statistics

Number of variables34
Number of observations148670
Missing cells181135
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory197.7 MiB
Average record size in memory1.4 KiB

Variable types

Numeric11
Categorical23

Alerts

year has constant value "2019"Constant
Gender is highly overall correlated with co-applicant_credit_typeHigh correlation
Interest_rate_spread is highly overall correlated with Secured_by and 4 other fieldsHigh correlation
Secured_by is highly overall correlated with Interest_rate_spread and 4 other fieldsHigh correlation
Security_Type is highly overall correlated with Interest_rate_spread and 4 other fieldsHigh correlation
Status is highly overall correlated with Interest_rate_spread and 1 other fieldsHigh correlation
Upfront_charges is highly overall correlated with Secured_by and 3 other fieldsHigh correlation
business_or_commercial is highly overall correlated with loan_typeHigh correlation
co-applicant_credit_type is highly overall correlated with GenderHigh correlation
construction_type is highly overall correlated with Interest_rate_spread and 4 other fieldsHigh correlation
credit_type is highly overall correlated with StatusHigh correlation
income is highly overall correlated with loan_amount and 1 other fieldsHigh correlation
loan_amount is highly overall correlated with income and 1 other fieldsHigh correlation
loan_type is highly overall correlated with business_or_commercialHigh correlation
open_credit is highly overall correlated with Upfront_chargesHigh correlation
property_value is highly overall correlated with income and 1 other fieldsHigh correlation
rate_of_interest is highly overall correlated with Interest_rate_spread and 3 other fieldsHigh correlation
loan_limit is highly imbalanced (63.9%)Imbalance
Credit_Worthiness is highly imbalanced (74.6%)Imbalance
open_credit is highly imbalanced (96.4%)Imbalance
Neg_ammortization is highly imbalanced (52.5%)Imbalance
interest_only is highly imbalanced (72.3%)Imbalance
lump_sum_payment is highly imbalanced (84.3%)Imbalance
construction_type is highly imbalanced (99.7%)Imbalance
occupancy_type is highly imbalanced (72.9%)Imbalance
Secured_by is highly imbalanced (99.7%)Imbalance
total_units is highly imbalanced (93.6%)Imbalance
Security_Type is highly imbalanced (99.7%)Imbalance
loan_limit has 3344 (2.2%) missing valuesMissing
rate_of_interest has 36439 (24.5%) missing valuesMissing
Interest_rate_spread has 36639 (24.6%) missing valuesMissing
Upfront_charges has 39642 (26.7%) missing valuesMissing
property_value has 15098 (10.2%) missing valuesMissing
income has 9150 (6.2%) missing valuesMissing
LTV has 15098 (10.2%) missing valuesMissing
dtir1 has 24121 (16.2%) missing valuesMissing
LTV is highly skewed (γ1 = 120.6153375)Skewed
ID is uniformly distributedUniform
ID has unique valuesUnique
Upfront_charges has 20770 (14.0%) zerosZeros

Reproduction

Analysis started2025-12-28 17:58:28.158826
Analysis finished2025-12-28 17:59:12.467743
Duration44.31 seconds
Software versionydata-profiling vv4.18.0
Download configurationconfig.json

Variables

ID
Real number (ℝ)

Uniform  Unique 

Distinct148670
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99224.5
Minimum24890
Maximum173559
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:12.587321image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum24890
5-th percentile32323.45
Q162057.25
median99224.5
Q3136391.75
95-th percentile166125.55
Maximum173559
Range148669
Interquartile range (IQR)74334.5

Descriptive statistics

Standard deviation42917.477
Coefficient of variation (CV)0.43252903
Kurtosis-1.2
Mean99224.5
Median Absolute Deviation (MAD)37167.5
Skewness0
Sum1.4751706 × 1010
Variance1.8419098 × 109
MonotonicityStrictly increasing
2025-12-28T18:59:12.725734image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
248901
 
< 0.1%
1239791
 
< 0.1%
1239991
 
< 0.1%
1240001
 
< 0.1%
1240011
 
< 0.1%
1240021
 
< 0.1%
1240031
 
< 0.1%
1240041
 
< 0.1%
1240051
 
< 0.1%
1240061
 
< 0.1%
Other values (148660)148660
> 99.9%
ValueCountFrequency (%)
248901
< 0.1%
248911
< 0.1%
248921
< 0.1%
248931
< 0.1%
248941
< 0.1%
248951
< 0.1%
248961
< 0.1%
248971
< 0.1%
248981
< 0.1%
248991
< 0.1%
ValueCountFrequency (%)
1735591
< 0.1%
1735581
< 0.1%
1735571
< 0.1%
1735561
< 0.1%
1735551
< 0.1%
1735541
< 0.1%
1735531
< 0.1%
1735521
< 0.1%
1735511
< 0.1%
1735501
< 0.1%

year
Categorical

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.6 MiB
2019
148670 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters594680
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019148670
100.0%

Length

2025-12-28T18:59:12.870365image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:12.948812image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
2019148670
100.0%

Most occurring characters

ValueCountFrequency (%)
2148670
25.0%
0148670
25.0%
1148670
25.0%
9148670
25.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)594680
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2148670
25.0%
0148670
25.0%
1148670
25.0%
9148670
25.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)594680
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2148670
25.0%
0148670
25.0%
1148670
25.0%
9148670
25.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)594680
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2148670
25.0%
0148670
25.0%
1148670
25.0%
9148670
25.0%

loan_limit
Categorical

Imbalance  Missing 

Distinct2
Distinct (%)< 0.1%
Missing3344
Missing (%)2.2%
Memory size8.4 MiB
cf
135348 
ncf
 
9978

Length

Max length3
Median length2
Mean length2.0686594
Min length2

Characters and Unicode

Total characters300630
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcf
2nd rowcf
3rd rowcf
4th rowcf
5th rowcf

Common Values

ValueCountFrequency (%)
cf135348
91.0%
ncf9978
 
6.7%
(Missing)3344
 
2.2%

Length

2025-12-28T18:59:13.032037image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:13.100452image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
cf135348
93.1%
ncf9978
 
6.9%

Most occurring characters

ValueCountFrequency (%)
c145326
48.3%
f145326
48.3%
n9978
 
3.3%

Most occurring categories

ValueCountFrequency (%)
(unknown)300630
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
c145326
48.3%
f145326
48.3%
n9978
 
3.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown)300630
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
c145326
48.3%
f145326
48.3%
n9978
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown)300630
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
c145326
48.3%
f145326
48.3%
n9978
 
3.3%

Gender
Categorical

High correlation 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size9.2 MiB
Male
42346 
Joint
41399 
Sex Not Available
37659 
Female
27266 

Length

Max length17
Median length6
Mean length7.9382391
Min length4

Characters and Unicode

Total characters1180178
Distinct characters18
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSex Not Available
2nd rowMale
3rd rowMale
4th rowMale
5th rowJoint

Common Values

ValueCountFrequency (%)
Male42346
28.5%
Joint41399
27.8%
Sex Not Available37659
25.3%
Female27266
18.3%

Length

2025-12-28T18:59:13.181368image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:13.262343image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
male42346
18.9%
joint41399
18.5%
sex37659
16.8%
not37659
16.8%
available37659
16.8%
female27266
12.2%

Most occurring characters

ValueCountFrequency (%)
e172196
14.6%
l144930
12.3%
a144930
12.3%
o79058
 
6.7%
i79058
 
6.7%
t79058
 
6.7%
75318
 
6.4%
M42346
 
3.6%
J41399
 
3.5%
n41399
 
3.5%
Other values (8)280486
23.8%

Most occurring categories

ValueCountFrequency (%)
(unknown)1180178
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e172196
14.6%
l144930
12.3%
a144930
12.3%
o79058
 
6.7%
i79058
 
6.7%
t79058
 
6.7%
75318
 
6.4%
M42346
 
3.6%
J41399
 
3.5%
n41399
 
3.5%
Other values (8)280486
23.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown)1180178
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e172196
14.6%
l144930
12.3%
a144930
12.3%
o79058
 
6.7%
i79058
 
6.7%
t79058
 
6.7%
75318
 
6.4%
M42346
 
3.6%
J41399
 
3.5%
n41399
 
3.5%
Other values (8)280486
23.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown)1180178
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e172196
14.6%
l144930
12.3%
a144930
12.3%
o79058
 
6.7%
i79058
 
6.7%
t79058
 
6.7%
75318
 
6.4%
M42346
 
3.6%
J41399
 
3.5%
n41399
 
3.5%
Other values (8)280486
23.8%

approv_in_adv
Categorical

Distinct2
Distinct (%)< 0.1%
Missing908
Missing (%)0.6%
Memory size8.7 MiB
nopre
124621 
pre
23141 

Length

Max length5
Median length5
Mean length4.6867801
Min length3

Characters and Unicode

Total characters692528
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownopre
2nd rownopre
3rd rowpre
4th rownopre
5th rowpre

Common Values

ValueCountFrequency (%)
nopre124621
83.8%
pre23141
 
15.6%
(Missing)908
 
0.6%

Length

2025-12-28T18:59:13.382648image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:13.465529image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
nopre124621
84.3%
pre23141
 
15.7%

Most occurring characters

ValueCountFrequency (%)
p147762
21.3%
r147762
21.3%
e147762
21.3%
n124621
18.0%
o124621
18.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)692528
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
p147762
21.3%
r147762
21.3%
e147762
21.3%
n124621
18.0%
o124621
18.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)692528
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
p147762
21.3%
r147762
21.3%
e147762
21.3%
n124621
18.0%
o124621
18.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)692528
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
p147762
21.3%
r147762
21.3%
e147762
21.3%
n124621
18.0%
o124621
18.0%

loan_type
Categorical

High correlation 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.8 MiB
type1
113173 
type2
20762 
type3
14735 

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters743350
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowtype1
2nd rowtype2
3rd rowtype1
4th rowtype1
5th rowtype1

Common Values

ValueCountFrequency (%)
type1113173
76.1%
type220762
 
14.0%
type314735
 
9.9%

Length

2025-12-28T18:59:13.745525image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:13.818528image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
type1113173
76.1%
type220762
 
14.0%
type314735
 
9.9%

Most occurring characters

ValueCountFrequency (%)
t148670
20.0%
y148670
20.0%
p148670
20.0%
e148670
20.0%
1113173
15.2%
220762
 
2.8%
314735
 
2.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)743350
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t148670
20.0%
y148670
20.0%
p148670
20.0%
e148670
20.0%
1113173
15.2%
220762
 
2.8%
314735
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)743350
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t148670
20.0%
y148670
20.0%
p148670
20.0%
e148670
20.0%
1113173
15.2%
220762
 
2.8%
314735
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)743350
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t148670
20.0%
y148670
20.0%
p148670
20.0%
e148670
20.0%
1113173
15.2%
220762
 
2.8%
314735
 
2.0%

loan_purpose
Categorical

Distinct4
Distinct (%)< 0.1%
Missing134
Missing (%)0.1%
Memory size8.4 MiB
p3
55934 
p4
54799 
p1
34529 
p2
 
3274

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters297072
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowp1
2nd rowp1
3rd rowp1
4th rowp4
5th rowp1

Common Values

ValueCountFrequency (%)
p355934
37.6%
p454799
36.9%
p134529
23.2%
p23274
 
2.2%
(Missing)134
 
0.1%

Length

2025-12-28T18:59:13.927526image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:14.017195image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
p355934
37.7%
p454799
36.9%
p134529
23.2%
p23274
 
2.2%

Most occurring characters

ValueCountFrequency (%)
p148536
50.0%
355934
 
18.8%
454799
 
18.4%
134529
 
11.6%
23274
 
1.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)297072
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
p148536
50.0%
355934
 
18.8%
454799
 
18.4%
134529
 
11.6%
23274
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)297072
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
p148536
50.0%
355934
 
18.8%
454799
 
18.4%
134529
 
11.6%
23274
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)297072
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
p148536
50.0%
355934
 
18.8%
454799
 
18.4%
134529
 
11.6%
23274
 
1.1%

Credit_Worthiness
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.4 MiB
l1
142344 
l2
 
6326

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters297340
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowl1
2nd rowl1
3rd rowl1
4th rowl1
5th rowl1

Common Values

ValueCountFrequency (%)
l1142344
95.7%
l26326
 
4.3%

Length

2025-12-28T18:59:14.127175image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:14.207088image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
l1142344
95.7%
l26326
 
4.3%

Most occurring characters

ValueCountFrequency (%)
l148670
50.0%
1142344
47.9%
26326
 
2.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l148670
50.0%
1142344
47.9%
26326
 
2.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l148670
50.0%
1142344
47.9%
26326
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l148670
50.0%
1142344
47.9%
26326
 
2.1%

open_credit
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.6 MiB
nopc
148114 
opc
 
556

Length

Max length4
Median length4
Mean length3.9962602
Min length3

Characters and Unicode

Total characters594124
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownopc
2nd rownopc
3rd rownopc
4th rownopc
5th rownopc

Common Values

ValueCountFrequency (%)
nopc148114
99.6%
opc556
 
0.4%

Length

2025-12-28T18:59:14.288775image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:14.359132image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
nopc148114
99.6%
opc556
 
0.4%

Most occurring characters

ValueCountFrequency (%)
o148670
25.0%
p148670
25.0%
c148670
25.0%
n148114
24.9%

Most occurring categories

ValueCountFrequency (%)
(unknown)594124
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o148670
25.0%
p148670
25.0%
c148670
25.0%
n148114
24.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown)594124
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o148670
25.0%
p148670
25.0%
c148670
25.0%
n148114
24.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown)594124
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o148670
25.0%
p148670
25.0%
c148670
25.0%
n148114
24.9%

business_or_commercial
Categorical

High correlation 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.8 MiB
nob/c
127908 
b/c
20762 

Length

Max length5
Median length5
Mean length4.7206968
Min length3

Characters and Unicode

Total characters701826
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownob/c
2nd rowb/c
3rd rownob/c
4th rownob/c
5th rownob/c

Common Values

ValueCountFrequency (%)
nob/c127908
86.0%
b/c20762
 
14.0%

Length

2025-12-28T18:59:14.447411image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:14.553448image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
nob/c127908
86.0%
b/c20762
 
14.0%

Most occurring characters

ValueCountFrequency (%)
b148670
21.2%
/148670
21.2%
c148670
21.2%
n127908
18.2%
o127908
18.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)701826
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
b148670
21.2%
/148670
21.2%
c148670
21.2%
n127908
18.2%
o127908
18.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)701826
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
b148670
21.2%
/148670
21.2%
c148670
21.2%
n127908
18.2%
o127908
18.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)701826
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
b148670
21.2%
/148670
21.2%
c148670
21.2%
n127908
18.2%
o127908
18.2%

loan_amount
Real number (ℝ)

High correlation 

Distinct211
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean331117.74
Minimum16500
Maximum3576500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:14.654565image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum16500
5-th percentile106500
Q1196500
median296500
Q3436500
95-th percentile656500
Maximum3576500
Range3560000
Interquartile range (IQR)240000

Descriptive statistics

Standard deviation183909.31
Coefficient of variation (CV)0.55541968
Kurtosis9.1277753
Mean331117.74
Median Absolute Deviation (MAD)120000
Skewness1.6669981
Sum4.9227275 × 1010
Variance3.3822634 × 1010
MonotonicityNot monotonic
2025-12-28T18:59:14.796428image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2065004610
 
3.1%
2565004079
 
2.7%
1565003967
 
2.7%
2265003944
 
2.7%
4865003819
 
2.6%
3065003691
 
2.5%
2465003669
 
2.5%
2165003649
 
2.5%
2365003553
 
2.4%
2665003543
 
2.4%
Other values (201)110146
74.1%
ValueCountFrequency (%)
165003
 
< 0.1%
2650027
 
< 0.1%
36500119
 
0.1%
46500212
 
0.1%
56500810
 
0.5%
66500859
 
0.6%
765001701
1.1%
865001605
1.1%
965001484
1.0%
1065003210
2.2%
ValueCountFrequency (%)
35765001
 
< 0.1%
33465001
 
< 0.1%
30065004
< 0.1%
29865001
 
< 0.1%
29265001
 
< 0.1%
27065001
 
< 0.1%
26265001
 
< 0.1%
26065001
 
< 0.1%
25965001
 
< 0.1%
25065002
< 0.1%

rate_of_interest
Real number (ℝ)

High correlation  Missing 

Distinct131
Distinct (%)0.1%
Missing36439
Missing (%)24.5%
Infinite0
Infinite (%)0.0%
Mean4.0454758
Minimum0
Maximum8
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:14.957188image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3.125
Q13.625
median3.99
Q34.375
95-th percentile4.99
Maximum8
Range8
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation0.56139119
Coefficient of variation (CV)0.13877013
Kurtosis0.34456404
Mean4.0454758
Median Absolute Deviation (MAD)0.365
Skewness0.38840603
Sum454027.8
Variance0.31516007
MonotonicityNot monotonic
2025-12-28T18:59:15.093890image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.9914455
 
9.7%
3.6258800
 
5.9%
3.8758592
 
5.8%
3.758474
 
5.7%
3.56866
 
4.6%
4.56809
 
4.6%
4.3756482
 
4.4%
4.256045
 
4.1%
4.1255797
 
3.9%
4.754875
 
3.3%
Other values (121)35036
23.6%
(Missing)36439
24.5%
ValueCountFrequency (%)
01
 
< 0.1%
2.1251
 
< 0.1%
2.254
 
< 0.1%
2.3752
 
< 0.1%
2.4752
 
< 0.1%
2.521
< 0.1%
2.5751
 
< 0.1%
2.63
 
< 0.1%
2.62525
< 0.1%
2.652
 
< 0.1%
ValueCountFrequency (%)
81
 
< 0.1%
7.751
 
< 0.1%
7.52
 
< 0.1%
7.3751
 
< 0.1%
7.1251
 
< 0.1%
71
 
< 0.1%
6.8751
 
< 0.1%
6.755
< 0.1%
6.53
< 0.1%
6.3751
 
< 0.1%

Interest_rate_spread
Real number (ℝ)

High correlation  Missing 

Distinct22516
Distinct (%)20.1%
Missing36639
Missing (%)24.6%
Infinite0
Infinite (%)0.0%
Mean0.44165566
Minimum-3.638
Maximum3.357
Zeros9
Zeros (%)< 0.1%
Negative21883
Negative (%)14.7%
Memory size1.1 MiB
2025-12-28T18:59:15.233286image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum-3.638
5-th percentile-0.31745
Q10.076
median0.3904
Q30.7754
95-th percentile1.3794
Maximum3.357
Range6.995
Interquartile range (IQR)0.6994

Descriptive statistics

Standard deviation0.51304274
Coefficient of variation (CV)1.1616351
Kurtosis-0.18356608
Mean0.44165566
Median Absolute Deviation (MAD)0.3427
Skewness0.28076233
Sum49479.125
Variance0.26321285
MonotonicityNot monotonic
2025-12-28T18:59:15.369719image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-0.02877
 
0.1%
-0.03864
 
< 0.1%
-0.02360
 
< 0.1%
-0.17356
 
< 0.1%
-0.14852
 
< 0.1%
0.25251
 
< 0.1%
0.20251
 
< 0.1%
0.25746
 
< 0.1%
0.11246
 
< 0.1%
-0.01345
 
< 0.1%
Other values (22506)111483
75.0%
(Missing)36639
 
24.6%
ValueCountFrequency (%)
-3.6381
< 0.1%
-1.08411
< 0.1%
-1.0471
< 0.1%
-1.04621
< 0.1%
-1.0391
< 0.1%
-1.0381
< 0.1%
-1.03791
< 0.1%
-1.03431
< 0.1%
-1.02941
< 0.1%
-1.02881
< 0.1%
ValueCountFrequency (%)
3.3571
< 0.1%
2.88541
< 0.1%
2.72271
< 0.1%
2.63681
< 0.1%
2.59321
< 0.1%
2.58511
< 0.1%
2.5371
< 0.1%
2.51441
< 0.1%
2.40931
< 0.1%
2.1821
< 0.1%

Upfront_charges
Real number (ℝ)

High correlation  Missing  Zeros 

Distinct58271
Distinct (%)53.4%
Missing39642
Missing (%)26.7%
Infinite0
Infinite (%)0.0%
Mean3224.9961
Minimum0
Maximum60000
Zeros20770
Zeros (%)14.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:15.511429image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1581.49
median2596.45
Q34812.5
95-th percentile9272.6885
Maximum60000
Range60000
Interquartile range (IQR)4231.01

Descriptive statistics

Standard deviation3251.1215
Coefficient of variation (CV)1.0081009
Kurtosis6.3685863
Mean3224.9961
Median Absolute Deviation (MAD)2108.66
Skewness1.7540757
Sum3.5161488 × 108
Variance10569791
MonotonicityNot monotonic
2025-12-28T18:59:15.646909image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
020770
 
14.0%
12501184
 
0.8%
1150892
 
0.6%
795487
 
0.3%
295403
 
0.3%
950192
 
0.1%
3000173
 
0.1%
995151
 
0.1%
4000149
 
0.1%
5000147
 
0.1%
Other values (58261)84480
56.8%
(Missing)39642
26.7%
ValueCountFrequency (%)
020770
14.0%
0.031
 
< 0.1%
0.061
 
< 0.1%
0.351
 
< 0.1%
0.61
 
< 0.1%
0.721
 
< 0.1%
0.751
 
< 0.1%
0.921
 
< 0.1%
112
 
< 0.1%
1.151
 
< 0.1%
ValueCountFrequency (%)
600001
< 0.1%
53485.781
< 0.1%
38437.51
< 0.1%
383751
< 0.1%
37604.381
< 0.1%
35192.51
< 0.1%
332681
< 0.1%
328501
< 0.1%
32825.251
< 0.1%
326471
< 0.1%

term
Real number (ℝ)

Distinct26
Distinct (%)< 0.1%
Missing41
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean335.13658
Minimum96
Maximum360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:15.767241image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum96
5-th percentile180
Q1360
median360
Q3360
95-th percentile360
Maximum360
Range264
Interquartile range (IQR)0

Descriptive statistics

Standard deviation58.409084
Coefficient of variation (CV)0.17428442
Kurtosis3.1732363
Mean335.13658
Median Absolute Deviation (MAD)0
Skewness-2.1748218
Sum49811015
Variance3411.621
MonotonicityNot monotonic
2025-12-28T18:59:15.879086image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
360121685
81.8%
18012981
 
8.7%
2405859
 
3.9%
3002822
 
1.9%
3242766
 
1.9%
120510
 
0.3%
144263
 
0.2%
348260
 
0.2%
336213
 
0.1%
96194
 
0.1%
Other values (16)1076
 
0.7%
ValueCountFrequency (%)
96194
 
0.1%
10833
 
< 0.1%
120510
 
0.3%
13293
 
0.1%
144263
 
0.2%
156174
 
0.1%
1651
 
< 0.1%
16882
 
0.1%
18012981
8.7%
19217
 
< 0.1%
ValueCountFrequency (%)
360121685
81.8%
348260
 
0.2%
336213
 
0.1%
3242766
 
1.9%
3221
 
< 0.1%
312185
 
0.1%
3002822
 
1.9%
28890
 
0.1%
2801
 
< 0.1%
276100
 
0.1%

Neg_ammortization
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing121
Missing (%)0.1%
Memory size9.1 MiB
not_neg
133420 
neg_amm
15129 

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters1039843
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownot_neg
2nd rownot_neg
3rd rowneg_amm
4th rownot_neg
5th rownot_neg

Common Values

ValueCountFrequency (%)
not_neg133420
89.7%
neg_amm15129
 
10.2%
(Missing)121
 
0.1%

Length

2025-12-28T18:59:15.997919image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:16.072968image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
not_neg133420
89.8%
neg_amm15129
 
10.2%

Most occurring characters

ValueCountFrequency (%)
n281969
27.1%
_148549
14.3%
e148549
14.3%
g148549
14.3%
o133420
12.8%
t133420
12.8%
m30258
 
2.9%
a15129
 
1.5%

Most occurring categories

ValueCountFrequency (%)
(unknown)1039843
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n281969
27.1%
_148549
14.3%
e148549
14.3%
g148549
14.3%
o133420
12.8%
t133420
12.8%
m30258
 
2.9%
a15129
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown)1039843
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n281969
27.1%
_148549
14.3%
e148549
14.3%
g148549
14.3%
o133420
12.8%
t133420
12.8%
m30258
 
2.9%
a15129
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown)1039843
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n281969
27.1%
_148549
14.3%
e148549
14.3%
g148549
14.3%
o133420
12.8%
t133420
12.8%
m30258
 
2.9%
a15129
 
1.5%

interest_only
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size9.1 MiB
not_int
141560 
int_only
 
7110

Length

Max length8
Median length7
Mean length7.047824
Min length7

Characters and Unicode

Total characters1047800
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownot_int
2nd rownot_int
3rd rownot_int
4th rownot_int
5th rownot_int

Common Values

ValueCountFrequency (%)
not_int141560
95.2%
int_only7110
 
4.8%

Length

2025-12-28T18:59:16.162802image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:16.234805image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
not_int141560
95.2%
int_only7110
 
4.8%

Most occurring characters

ValueCountFrequency (%)
n297340
28.4%
t290230
27.7%
o148670
14.2%
_148670
14.2%
i148670
14.2%
l7110
 
0.7%
y7110
 
0.7%

Most occurring categories

ValueCountFrequency (%)
(unknown)1047800
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n297340
28.4%
t290230
27.7%
o148670
14.2%
_148670
14.2%
i148670
14.2%
l7110
 
0.7%
y7110
 
0.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown)1047800
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n297340
28.4%
t290230
27.7%
o148670
14.2%
_148670
14.2%
i148670
14.2%
l7110
 
0.7%
y7110
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown)1047800
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n297340
28.4%
t290230
27.7%
o148670
14.2%
_148670
14.2%
i148670
14.2%
l7110
 
0.7%
y7110
 
0.7%

lump_sum_payment
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size9.2 MiB
not_lpsm
145286 
lpsm
 
3384

Length

Max length8
Median length8
Mean length7.9089527
Min length4

Characters and Unicode

Total characters1175824
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownot_lpsm
2nd rowlpsm
3rd rownot_lpsm
4th rownot_lpsm
5th rownot_lpsm

Common Values

ValueCountFrequency (%)
not_lpsm145286
97.7%
lpsm3384
 
2.3%

Length

2025-12-28T18:59:16.326216image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:16.404745image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
not_lpsm145286
97.7%
lpsm3384
 
2.3%

Most occurring characters

ValueCountFrequency (%)
l148670
12.6%
p148670
12.6%
s148670
12.6%
m148670
12.6%
n145286
12.4%
o145286
12.4%
t145286
12.4%
_145286
12.4%

Most occurring categories

ValueCountFrequency (%)
(unknown)1175824
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l148670
12.6%
p148670
12.6%
s148670
12.6%
m148670
12.6%
n145286
12.4%
o145286
12.4%
t145286
12.4%
_145286
12.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown)1175824
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l148670
12.6%
p148670
12.6%
s148670
12.6%
m148670
12.6%
n145286
12.4%
o145286
12.4%
t145286
12.4%
_145286
12.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown)1175824
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l148670
12.6%
p148670
12.6%
s148670
12.6%
m148670
12.6%
n145286
12.4%
o145286
12.4%
t145286
12.4%
_145286
12.4%

property_value
Real number (ℝ)

High correlation  Missing 

Distinct385
Distinct (%)0.3%
Missing15098
Missing (%)10.2%
Infinite0
Infinite (%)0.0%
Mean497893.47
Minimum8000
Maximum16508000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:16.498474image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum8000
5-th percentile148000
Q1268000
median418000
Q3628000
95-th percentile1058000
Maximum16508000
Range16500000
Interquartile range (IQR)360000

Descriptive statistics

Standard deviation359935.32
Coefficient of variation (CV)0.72291633
Kurtosis73.221196
Mean497893.47
Median Absolute Deviation (MAD)170000
Skewness4.5862758
Sum6.6504626 × 1010
Variance1.2955343 × 1011
MonotonicityNot monotonic
2025-12-28T18:59:16.637814image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3080002792
 
1.9%
2580002763
 
1.9%
3580002679
 
1.8%
4080002537
 
1.7%
3280002524
 
1.7%
2780002513
 
1.7%
2680002497
 
1.7%
2280002493
 
1.7%
2380002408
 
1.6%
2880002398
 
1.6%
Other values (375)107968
72.6%
(Missing)15098
 
10.2%
ValueCountFrequency (%)
80006
 
< 0.1%
180001
 
< 0.1%
280009
 
< 0.1%
3800035
 
< 0.1%
4800071
 
< 0.1%
58000141
 
0.1%
68000271
0.2%
78000387
0.3%
88000568
0.4%
98000556
0.4%
ValueCountFrequency (%)
165080001
< 0.1%
120080001
< 0.1%
110080001
< 0.1%
100080001
< 0.1%
92680001
< 0.1%
85080001
< 0.1%
76080001
< 0.1%
69080001
< 0.1%
65080001
< 0.1%
64080001
< 0.1%

construction_type
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.4 MiB
sb
148637 
mh
 
33

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters297340
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowsb
2nd rowsb
3rd rowsb
4th rowsb
5th rowsb

Common Values

ValueCountFrequency (%)
sb148637
> 99.9%
mh33
 
< 0.1%

Length

2025-12-28T18:59:16.885356image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:16.960918image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
sb148637
> 99.9%
mh33
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
s148637
50.0%
b148637
50.0%
m33
 
< 0.1%
h33
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
s148637
50.0%
b148637
50.0%
m33
 
< 0.1%
h33
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
s148637
50.0%
b148637
50.0%
m33
 
< 0.1%
h33
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
s148637
50.0%
b148637
50.0%
m33
 
< 0.1%
h33
 
< 0.1%

occupancy_type
Categorical

Imbalance 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.4 MiB
pr
138201 
ir
 
7340
sr
 
3129

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters297340
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowpr
2nd rowpr
3rd rowpr
4th rowpr
5th rowpr

Common Values

ValueCountFrequency (%)
pr138201
93.0%
ir7340
 
4.9%
sr3129
 
2.1%

Length

2025-12-28T18:59:17.049585image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:17.131320image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
pr138201
93.0%
ir7340
 
4.9%
sr3129
 
2.1%

Most occurring characters

ValueCountFrequency (%)
r148670
50.0%
p138201
46.5%
i7340
 
2.5%
s3129
 
1.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r148670
50.0%
p138201
46.5%
i7340
 
2.5%
s3129
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r148670
50.0%
p138201
46.5%
i7340
 
2.5%
s3129
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r148670
50.0%
p138201
46.5%
i7340
 
2.5%
s3129
 
1.1%

Secured_by
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.6 MiB
home
148637 
land
 
33

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters594680
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhome
2nd rowhome
3rd rowhome
4th rowhome
5th rowhome

Common Values

ValueCountFrequency (%)
home148637
> 99.9%
land33
 
< 0.1%

Length

2025-12-28T18:59:17.226708image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:17.304913image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
home148637
> 99.9%
land33
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
h148637
25.0%
o148637
25.0%
m148637
25.0%
e148637
25.0%
l33
 
< 0.1%
a33
 
< 0.1%
n33
 
< 0.1%
d33
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)594680
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
h148637
25.0%
o148637
25.0%
m148637
25.0%
e148637
25.0%
l33
 
< 0.1%
a33
 
< 0.1%
n33
 
< 0.1%
d33
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)594680
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
h148637
25.0%
o148637
25.0%
m148637
25.0%
e148637
25.0%
l33
 
< 0.1%
a33
 
< 0.1%
n33
 
< 0.1%
d33
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)594680
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
h148637
25.0%
o148637
25.0%
m148637
25.0%
e148637
25.0%
l33
 
< 0.1%
a33
 
< 0.1%
n33
 
< 0.1%
d33
 
< 0.1%

total_units
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.4 MiB
1U
146480 
2U
 
1477
3U
 
393
4U
 
320

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters297340
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1U
2nd row1U
3rd row1U
4th row1U
5th row1U

Common Values

ValueCountFrequency (%)
1U146480
98.5%
2U1477
 
1.0%
3U393
 
0.3%
4U320
 
0.2%

Length

2025-12-28T18:59:17.395751image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:17.488504image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
1u146480
98.5%
2u1477
 
1.0%
3u393
 
0.3%
4u320
 
0.2%

Most occurring characters

ValueCountFrequency (%)
U148670
50.0%
1146480
49.3%
21477
 
0.5%
3393
 
0.1%
4320
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
U148670
50.0%
1146480
49.3%
21477
 
0.5%
3393
 
0.1%
4320
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
U148670
50.0%
1146480
49.3%
21477
 
0.5%
3393
 
0.1%
4320
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)297340
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
U148670
50.0%
1146480
49.3%
21477
 
0.5%
3393
 
0.1%
4320
 
0.1%

income
Real number (ℝ)

High correlation  Missing 

Distinct1001
Distinct (%)0.7%
Missing9150
Missing (%)6.2%
Infinite0
Infinite (%)0.0%
Mean6957.3389
Minimum0
Maximum578580
Zeros1260
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:17.592515image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1920
Q13720
median5760
Q38520
95-th percentile15420
Maximum578580
Range578580
Interquartile range (IQR)4800

Descriptive statistics

Standard deviation6496.5864
Coefficient of variation (CV)0.93377461
Kurtosis885.29246
Mean6957.3389
Median Absolute Deviation (MAD)2280
Skewness17.307695
Sum9.7068792 × 108
Variance42205635
MonotonicityNot monotonic
2025-12-28T18:59:17.727856image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01260
 
0.8%
36001250
 
0.8%
42001243
 
0.8%
48001191
 
0.8%
31201168
 
0.8%
37201161
 
0.8%
39001159
 
0.8%
54001152
 
0.8%
33001144
 
0.8%
45001139
 
0.8%
Other values (991)127653
85.9%
(Missing)9150
 
6.2%
ValueCountFrequency (%)
01260
0.8%
605
 
< 0.1%
12012
 
< 0.1%
18012
 
< 0.1%
24015
 
< 0.1%
30018
 
< 0.1%
36011
 
< 0.1%
42015
 
< 0.1%
48011
 
< 0.1%
54017
 
< 0.1%
ValueCountFrequency (%)
5785801
< 0.1%
3772201
< 0.1%
3744001
< 0.1%
3358802
< 0.1%
3294601
< 0.1%
3228601
< 0.1%
3120001
< 0.1%
2400001
< 0.1%
2359801
< 0.1%
1980601
< 0.1%

credit_type
Categorical

High correlation 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.6 MiB
CIB
48152 
CRIF
43901 
EXP
41319 
EQUI
15298 

Length

Max length4
Median length3
Mean length3.3981906
Min length3

Characters and Unicode

Total characters505209
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEXP
2nd rowEQUI
3rd rowEXP
4th rowEXP
5th rowCRIF

Common Values

ValueCountFrequency (%)
CIB48152
32.4%
CRIF43901
29.5%
EXP41319
27.8%
EQUI15298
 
10.3%

Length

2025-12-28T18:59:17.855280image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:17.931650image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
cib48152
32.4%
crif43901
29.5%
exp41319
27.8%
equi15298
 
10.3%

Most occurring characters

ValueCountFrequency (%)
I107351
21.2%
C92053
18.2%
E56617
11.2%
B48152
9.5%
R43901
8.7%
F43901
8.7%
X41319
 
8.2%
P41319
 
8.2%
Q15298
 
3.0%
U15298
 
3.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)505209
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
I107351
21.2%
C92053
18.2%
E56617
11.2%
B48152
9.5%
R43901
8.7%
F43901
8.7%
X41319
 
8.2%
P41319
 
8.2%
Q15298
 
3.0%
U15298
 
3.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)505209
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
I107351
21.2%
C92053
18.2%
E56617
11.2%
B48152
9.5%
R43901
8.7%
F43901
8.7%
X41319
 
8.2%
P41319
 
8.2%
Q15298
 
3.0%
U15298
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)505209
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
I107351
21.2%
C92053
18.2%
E56617
11.2%
B48152
9.5%
R43901
8.7%
F43901
8.7%
X41319
 
8.2%
P41319
 
8.2%
Q15298
 
3.0%
U15298
 
3.0%

Credit_Score
Real number (ℝ)

Distinct401
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean699.7891
Minimum500
Maximum900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:18.047090image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum500
5-th percentile519
Q1599
median699
Q3800
95-th percentile881
Maximum900
Range400
Interquartile range (IQR)201

Descriptive statistics

Standard deviation115.87586
Coefficient of variation (CV)0.16558683
Kurtosis-1.2026494
Mean699.7891
Median Absolute Deviation (MAD)100
Skewness0.004766757
Sum1.0403765 × 108
Variance13427.214
MonotonicityNot monotonic
2025-12-28T18:59:18.191126image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
763415
 
0.3%
867413
 
0.3%
639411
 
0.3%
581408
 
0.3%
554407
 
0.3%
519406
 
0.3%
737406
 
0.3%
890406
 
0.3%
687405
 
0.3%
617405
 
0.3%
Other values (391)144588
97.3%
ValueCountFrequency (%)
500357
0.2%
501357
0.2%
502346
0.2%
503383
0.3%
504392
0.3%
505379
0.3%
506380
0.3%
507386
0.3%
508400
0.3%
509348
0.2%
ValueCountFrequency (%)
900393
0.3%
899352
0.2%
898370
0.2%
897383
0.3%
896391
0.3%
895371
0.2%
894361
0.2%
893348
0.2%
892366
0.2%
891376
0.3%

co-applicant_credit_type
Categorical

High correlation 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.5 MiB
CIB
74392 
EXP
74278 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters446010
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCIB
2nd rowEXP
3rd rowCIB
4th rowCIB
5th rowEXP

Common Values

ValueCountFrequency (%)
CIB74392
50.0%
EXP74278
50.0%

Length

2025-12-28T18:59:18.332772image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:18.404479image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
cib74392
50.0%
exp74278
50.0%

Most occurring characters

ValueCountFrequency (%)
C74392
16.7%
I74392
16.7%
B74392
16.7%
E74278
16.7%
X74278
16.7%
P74278
16.7%

Most occurring categories

ValueCountFrequency (%)
(unknown)446010
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C74392
16.7%
I74392
16.7%
B74392
16.7%
E74278
16.7%
X74278
16.7%
P74278
16.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown)446010
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C74392
16.7%
I74392
16.7%
B74392
16.7%
E74278
16.7%
X74278
16.7%
P74278
16.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown)446010
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C74392
16.7%
I74392
16.7%
B74392
16.7%
E74278
16.7%
X74278
16.7%
P74278
16.7%

age
Categorical

Distinct7
Distinct (%)< 0.1%
Missing200
Missing (%)0.1%
Memory size8.8 MiB
45-54
34720 
35-44
32818 
55-64
32534 
65-74
20744 
25-34
19142 
Other values (2)
8512 

Length

Max length5
Median length5
Mean length4.8853371
Min length3

Characters and Unicode

Total characters725326
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row25-34
2nd row55-64
3rd row35-44
4th row45-54
5th row25-34

Common Values

ValueCountFrequency (%)
45-5434720
23.4%
35-4432818
22.1%
55-6432534
21.9%
65-7420744
14.0%
25-3419142
12.9%
>747175
 
4.8%
<251337
 
0.9%
(Missing)200
 
0.1%

Length

2025-12-28T18:59:18.509612image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:18.611941image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
45-5434720
23.4%
35-4432818
22.1%
55-6432534
21.9%
65-7420744
14.0%
25-3419142
12.9%
747175
 
4.8%
251337
 
0.9%

Most occurring characters

ValueCountFrequency (%)
4214671
29.6%
5208549
28.8%
-139958
19.3%
653278
 
7.3%
351960
 
7.2%
727919
 
3.8%
220479
 
2.8%
>7175
 
1.0%
<1337
 
0.2%

Most occurring categories

ValueCountFrequency (%)
(unknown)725326
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
4214671
29.6%
5208549
28.8%
-139958
19.3%
653278
 
7.3%
351960
 
7.2%
727919
 
3.8%
220479
 
2.8%
>7175
 
1.0%
<1337
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown)725326
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
4214671
29.6%
5208549
28.8%
-139958
19.3%
653278
 
7.3%
351960
 
7.2%
727919
 
3.8%
220479
 
2.8%
>7175
 
1.0%
<1337
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown)725326
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
4214671
29.6%
5208549
28.8%
-139958
19.3%
653278
 
7.3%
351960
 
7.2%
727919
 
3.8%
220479
 
2.8%
>7175
 
1.0%
<1337
 
0.2%
Distinct2
Distinct (%)< 0.1%
Missing200
Missing (%)0.1%
Memory size9.1 MiB
to_inst
95814 
not_inst
52656 

Length

Max length8
Median length7
Mean length7.3546575
Min length7

Characters and Unicode

Total characters1091946
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowto_inst
2nd rowto_inst
3rd rowto_inst
4th rownot_inst
5th rownot_inst

Common Values

ValueCountFrequency (%)
to_inst95814
64.4%
not_inst52656
35.4%
(Missing)200
 
0.1%

Length

2025-12-28T18:59:18.734652image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:18.803037image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
to_inst95814
64.5%
not_inst52656
35.5%

Most occurring characters

ValueCountFrequency (%)
t296940
27.2%
n201126
18.4%
o148470
13.6%
_148470
13.6%
i148470
13.6%
s148470
13.6%

Most occurring categories

ValueCountFrequency (%)
(unknown)1091946
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t296940
27.2%
n201126
18.4%
o148470
13.6%
_148470
13.6%
i148470
13.6%
s148470
13.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown)1091946
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t296940
27.2%
n201126
18.4%
o148470
13.6%
_148470
13.6%
i148470
13.6%
s148470
13.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown)1091946
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t296940
27.2%
n201126
18.4%
o148470
13.6%
_148470
13.6%
i148470
13.6%
s148470
13.6%

LTV
Real number (ℝ)

Missing  Skewed 

Distinct8484
Distinct (%)6.4%
Missing15098
Missing (%)10.2%
Infinite0
Infinite (%)0.0%
Mean72.746457
Minimum0.9674782
Maximum7831.25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:18.904220image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0.9674782
5-th percentile36.350575
Q160.47486
median75.13587
Q386.184211
95-th percentile98.728814
Maximum7831.25
Range7830.2825
Interquartile range (IQR)25.70935

Descriptive statistics

Standard deviation39.967603
Coefficient of variation (CV)0.54940961
Kurtosis19979.045
Mean72.746457
Median Absolute Deviation (MAD)12.514733
Skewness120.61534
Sum9716889.8
Variance1597.4093
MonotonicityNot monotonic
2025-12-28T18:59:19.048238image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
81.25530
 
0.4%
91.66666667499
 
0.3%
80.03875969380
 
0.3%
80.03246753328
 
0.2%
94.95614035322
 
0.2%
78.84615385317
 
0.2%
78.64583333310
 
0.2%
79.04040404309
 
0.2%
80.06329114309
 
0.2%
95.16806723306
 
0.2%
Other values (8474)129962
87.4%
(Missing)15098
 
10.2%
ValueCountFrequency (%)
0.9674781981
< 0.1%
2.0729426431
< 0.1%
2.7675873971
< 0.1%
2.813745021
< 0.1%
2.8564206271
< 0.1%
2.9925847461
< 0.1%
3.0835543771
< 0.1%
3.1251
< 0.1%
3.746684351
< 0.1%
3.8751714681
< 0.1%
ValueCountFrequency (%)
7831.251
< 0.1%
6706.251
< 0.1%
5206.251
< 0.1%
4706.251
< 0.1%
2956.251
< 0.1%
2331.251
< 0.1%
263.54166671
< 0.1%
237.52
< 0.1%
220.36290321
< 0.1%
201.78571431
< 0.1%

Region
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.8 MiB
North
74722 
south
64016 
central
8697 
North-East
 
1235

Length

Max length10
Median length5
Mean length5.1585323
Min length5

Characters and Unicode

Total characters766919
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowsouth
2nd rowNorth
3rd rowsouth
4th rowNorth
5th rowNorth

Common Values

ValueCountFrequency (%)
North74722
50.3%
south64016
43.1%
central8697
 
5.8%
North-East1235
 
0.8%

Length

2025-12-28T18:59:19.180369image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:19.281110image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
north74722
50.3%
south64016
43.1%
central8697
 
5.8%
north-east1235
 
0.8%

Most occurring characters

ValueCountFrequency (%)
t149905
19.5%
o139973
18.3%
h139973
18.3%
r84654
11.0%
N75957
9.9%
s65251
8.5%
u64016
8.3%
a9932
 
1.3%
c8697
 
1.1%
e8697
 
1.1%
Other values (4)19864
 
2.6%

Most occurring categories

ValueCountFrequency (%)
(unknown)766919
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t149905
19.5%
o139973
18.3%
h139973
18.3%
r84654
11.0%
N75957
9.9%
s65251
8.5%
u64016
8.3%
a9932
 
1.3%
c8697
 
1.1%
e8697
 
1.1%
Other values (4)19864
 
2.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown)766919
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t149905
19.5%
o139973
18.3%
h139973
18.3%
r84654
11.0%
N75957
9.9%
s65251
8.5%
u64016
8.3%
a9932
 
1.3%
c8697
 
1.1%
e8697
 
1.1%
Other values (4)19864
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown)766919
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t149905
19.5%
o139973
18.3%
h139973
18.3%
r84654
11.0%
N75957
9.9%
s65251
8.5%
u64016
8.3%
a9932
 
1.3%
c8697
 
1.1%
e8697
 
1.1%
Other values (4)19864
 
2.6%

Security_Type
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.9 MiB
direct
148637 
Indriect
 
33

Length

Max length8
Median length6
Mean length6.0004439
Min length6

Characters and Unicode

Total characters892086
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdirect
2nd rowdirect
3rd rowdirect
4th rowdirect
5th rowdirect

Common Values

ValueCountFrequency (%)
direct148637
> 99.9%
Indriect33
 
< 0.1%

Length

2025-12-28T18:59:19.479109image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:19.607111image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
direct148637
> 99.9%
indriect33
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
d148670
16.7%
i148670
16.7%
r148670
16.7%
e148670
16.7%
c148670
16.7%
t148670
16.7%
I33
 
< 0.1%
n33
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown)892086
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
d148670
16.7%
i148670
16.7%
r148670
16.7%
e148670
16.7%
c148670
16.7%
t148670
16.7%
I33
 
< 0.1%
n33
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown)892086
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
d148670
16.7%
i148670
16.7%
r148670
16.7%
e148670
16.7%
c148670
16.7%
t148670
16.7%
I33
 
< 0.1%
n33
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown)892086
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
d148670
16.7%
i148670
16.7%
r148670
16.7%
e148670
16.7%
c148670
16.7%
t148670
16.7%
I33
 
< 0.1%
n33
 
< 0.1%

Status
Categorical

High correlation 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.2 MiB
0
112031 
1
36639 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters148670
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0112031
75.4%
136639
 
24.6%

Length

2025-12-28T18:59:19.719112image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-12-28T18:59:19.808113image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
0112031
75.4%
136639
 
24.6%

Most occurring characters

ValueCountFrequency (%)
0112031
75.4%
136639
 
24.6%

Most occurring categories

ValueCountFrequency (%)
(unknown)148670
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0112031
75.4%
136639
 
24.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown)148670
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0112031
75.4%
136639
 
24.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown)148670
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0112031
75.4%
136639
 
24.6%

dtir1
Real number (ℝ)

Missing 

Distinct57
Distinct (%)< 0.1%
Missing24121
Missing (%)16.2%
Infinite0
Infinite (%)0.0%
Mean37.732932
Minimum5
Maximum61
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 MiB
2025-12-28T18:59:19.908114image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile20
Q131
median39
Q345
95-th percentile54
Maximum61
Range56
Interquartile range (IQR)14

Descriptive statistics

Standard deviation10.545435
Coefficient of variation (CV)0.27947563
Kurtosis0.37888256
Mean37.732932
Median Absolute Deviation (MAD)7
Skewness-0.55146496
Sum4699599
Variance111.2062
MonotonicityNot monotonic
2025-12-28T18:59:20.066105image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
376848
 
4.6%
366553
 
4.4%
446500
 
4.4%
496309
 
4.2%
435307
 
3.6%
425121
 
3.4%
414881
 
3.3%
404699
 
3.2%
394540
 
3.1%
384461
 
3.0%
Other values (47)69330
46.6%
(Missing)24121
 
16.2%
ValueCountFrequency (%)
5386
0.3%
6420
0.3%
7379
0.3%
8433
0.3%
9395
0.3%
10386
0.3%
11400
0.3%
12383
0.3%
13421
0.3%
14393
0.3%
ValueCountFrequency (%)
61692
0.5%
60832
0.6%
59812
0.5%
58757
0.5%
57823
0.6%
56746
0.5%
55798
0.5%
54832
0.6%
53787
0.5%
52777
0.5%

Interactions

2025-12-28T18:59:08.376975image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:52.689828image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.365266image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.999667image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.498860image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.937945image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.553823image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.037253image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.571748image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.175591image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.781034image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:08.520591image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:52.850303image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.510879image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.136082image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.627056image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.083100image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.698861image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.180864image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.710750image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.318094image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.960525image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:08.654143image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:52.985240image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.652053image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.273743image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.761347image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.205668image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.831614image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.316899image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.845829image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.471696image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.132521image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:08.785912image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:53.127456image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.788606image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.409785image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.898476image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.343189image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.955502image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.462199image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.980930image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.613893image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.266644image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:08.914137image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:53.266739image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.923088image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.546609image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.019018image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.469248image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.084893image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.586189image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:04.111877image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.761085image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.401637image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:09.155192image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:53.428903image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.066129image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.675930image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.143986image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.611582image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.226866image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.732798image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:04.260972image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.912102image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.553713image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:09.279016image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:53.567391image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.195987image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.804596image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.272465image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.837435image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.346546image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.863762image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:04.492798image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.048358image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.688391image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:09.416121image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:53.716717image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.339231image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:56.931441image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.395867image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:59.976474image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.489552image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:02.997878image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:04.641779image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.199817image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.829345image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:09.555704image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:53.865849image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.580592image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.075208image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.533512image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.130299image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.628889image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.147544image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:04.776857image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.346096image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:07.964341image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:09.687048image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.012002image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.716153image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.217296image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.678900image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.271495image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.765557image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.285556image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:04.905968image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.489538image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:08.113746image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:09.816432image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:54.165260image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:55.865808image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:57.352180image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:58:58.812906image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:00.411924image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:01.904264image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:03.427563image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:05.034999image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:06.638552image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-12-28T18:59:08.247750image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-12-28T18:59:20.321256image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Credit_ScoreCredit_WorthinessGenderIDInterest_rate_spreadLTVNeg_ammortizationRegionSecured_bySecurity_TypeStatusUpfront_chargesageapprov_in_advbusiness_or_commercialco-applicant_credit_typeconstruction_typecredit_typedtir1incomeinterest_onlyloan_amountloan_limitloan_purposeloan_typelump_sum_paymentoccupancy_typeopen_creditproperty_valuerate_of_interestsubmission_of_applicationtermtotal_units
Credit_Score1.0000.0060.000-0.001-0.003-0.0040.0020.0000.0070.0070.005-0.0020.0040.0000.0000.0000.0070.003-0.0000.0010.0000.0050.0000.0070.0000.0000.0030.0000.005-0.0020.006-0.0050.003
Credit_Worthiness0.0061.0000.0040.0060.0560.0120.0590.0050.0000.0000.0350.0000.0100.0620.0010.0130.0000.0230.0150.0130.0500.0140.0240.0500.0130.0310.0070.2300.0380.1650.0220.2200.008
Gender0.0000.0041.0000.0000.0720.0020.0170.3760.0030.0030.0840.0460.0710.0120.0480.6650.0030.0230.0380.0090.0120.1130.0390.0690.0800.0120.0200.0220.0230.0390.3600.0450.013
ID-0.0010.0060.0001.0000.003-0.0050.0000.0000.0050.0050.004-0.0050.0000.0000.0000.0070.0050.005-0.0070.0040.006-0.0010.0000.0000.0020.0000.0000.0080.001-0.0000.002-0.0020.004
Interest_rate_spread-0.0030.0560.0720.0031.0000.1110.0370.0221.0001.0001.0000.1000.0340.0560.3850.0681.0000.0260.075-0.2510.013-0.4260.0550.1890.4120.0100.1280.058-0.4520.5830.299-0.1660.027
LTV-0.0040.0120.002-0.0050.1111.0000.0000.0000.0000.0000.003-0.1150.0000.0030.0160.0040.0000.0000.156-0.0230.0000.0830.0080.0000.0100.0000.0000.000-0.3650.0300.0060.2100.000
Neg_ammortization0.0020.0590.0170.0000.0370.0001.0000.0070.0060.0060.1560.0250.0150.0780.0140.0100.0060.0790.0310.0080.0180.0290.0080.0730.0190.0510.0310.0000.0100.1770.0510.1990.014
Region0.0000.0050.3760.0000.0220.0000.0071.0000.0040.0040.0500.0300.0240.0080.0540.0300.0040.0150.0260.0000.0000.0050.0070.0470.0510.0130.0320.0090.0080.0120.1450.0280.006
Secured_by0.0070.0000.0030.0051.0000.0000.0060.0041.0000.9850.0251.0000.0040.0000.0060.0050.9850.0030.0000.0000.0000.0000.0020.0040.0070.0050.0000.0000.0001.0000.0000.0000.000
Security_Type0.0070.0000.0030.0051.0000.0000.0060.0040.9851.0000.0251.0000.0040.0000.0060.0050.9850.0030.0000.0000.0000.0000.0020.0040.0070.0050.0000.0000.0001.0000.0000.0000.000
Status0.0050.0350.0840.0041.0000.0030.1560.0500.0250.0251.0000.0100.0500.0370.0920.1440.0250.5920.2240.0090.0140.0850.0540.0400.0940.1880.0300.0100.0250.0250.1210.1020.029
Upfront_charges-0.0020.0000.046-0.0050.100-0.1150.0250.0301.0001.0000.0101.0000.0290.0050.1070.0341.0000.009-0.017-0.0800.000-0.1160.0750.0870.0840.0000.0261.000-0.075-0.0520.165-0.1160.037
age0.0040.0100.0710.0000.0340.0000.0150.0240.0040.0040.0500.0291.0000.0320.0790.0490.0040.0200.0330.0070.0070.0810.0320.1910.1080.0110.0600.0400.0240.0300.2700.0560.007
approv_in_adv0.0000.0620.0120.0000.0560.0030.0780.0080.0000.0000.0370.0050.0321.0000.0100.0120.0000.0180.0170.0000.0740.0290.0960.1540.0130.0610.0200.0050.0160.0830.0810.0470.004
business_or_commercial0.0000.0010.0480.0000.3850.0160.0140.0540.0060.0060.0920.1070.0790.0101.0000.0230.0060.0280.3030.0120.0070.1420.0220.0651.0000.0140.1110.0240.0440.0500.0900.1020.026
co-applicant_credit_type0.0000.0130.6650.0070.0680.0040.0100.0300.0050.0050.1440.0340.0490.0120.0231.0000.0050.3390.0530.0080.0140.1300.0410.0450.0500.0340.0260.0160.0330.0440.0620.0180.000
construction_type0.0070.0000.0030.0051.0000.0000.0060.0040.9850.9850.0251.0000.0040.0000.0060.0051.0000.0030.0000.0000.0000.0000.0020.0040.0070.0050.0000.0000.0001.0000.0000.0000.000
credit_type0.0030.0230.0230.0050.0260.0000.0790.0150.0030.0030.5920.0090.0200.0180.0280.3390.0031.0000.0040.0000.0140.0190.0320.0390.0490.1210.0080.0090.0060.0270.0460.0340.000
dtir1-0.0000.0150.038-0.0070.0750.1560.0310.0260.0000.0000.224-0.0170.0330.0170.3030.0530.0000.0041.000-0.3070.0040.0210.0340.0820.2620.0410.0390.013-0.0480.0620.0560.1080.014
income0.0010.0130.0090.004-0.251-0.0230.0080.0000.0000.0000.009-0.0800.0070.0000.0120.0080.0000.000-0.3071.0000.0000.6420.0470.0140.0100.0000.0440.0100.606-0.0900.020-0.0410.017
interest_only0.0000.0500.0120.0060.0130.0000.0180.0000.0000.0000.0140.0000.0070.0740.0070.0140.0000.0140.0040.0001.0000.0000.0310.0220.0110.0330.0110.2730.0400.1340.0100.0240.003
loan_amount0.0050.0140.113-0.001-0.4260.0830.0290.0050.0000.0000.085-0.1160.0810.0290.1420.1300.0000.0190.0210.6420.0001.0000.4560.1020.1050.0070.0330.0300.857-0.1720.4110.1960.079
loan_limit0.0000.0240.0390.0000.0550.0080.0080.0070.0020.0020.0540.0750.0320.0960.0220.0410.0020.0320.0340.0470.0310.4561.0000.0410.0630.0190.0150.0180.1500.0530.0110.0750.008
loan_purpose0.0070.0500.0690.0000.1890.0000.0730.0470.0040.0040.0400.0870.1910.1540.0650.0450.0040.0390.0820.0140.0220.1020.0411.0000.0660.0160.1320.0830.0290.1920.2650.1070.017
loan_type0.0000.0130.0800.0020.4120.0100.0190.0510.0070.0070.0940.0840.1080.0131.0000.0500.0070.0490.2620.0100.0110.1050.0630.0661.0000.0140.1090.0340.0390.2200.1100.1050.028
lump_sum_payment0.0000.0310.0120.0000.0100.0000.0510.0130.0050.0050.1880.0000.0110.0610.0140.0340.0050.1210.0410.0000.0330.0070.0190.0160.0141.0000.0000.0040.0000.0120.0130.0130.000
occupancy_type0.0030.0070.0200.0000.1280.0000.0310.0320.0000.0000.0300.0260.0600.0200.1110.0260.0000.0080.0390.0440.0110.0330.0150.1320.1090.0001.0000.0140.0000.1910.0670.0240.169
open_credit0.0000.2300.0220.0080.0580.0000.0000.0090.0000.0000.0101.0000.0400.0050.0240.0160.0000.0090.0130.0100.2730.0300.0180.0830.0340.0040.0141.0000.1370.4590.0450.0250.005
property_value0.0050.0380.0230.001-0.452-0.3650.0100.0080.0000.0000.025-0.0750.0240.0160.0440.0330.0000.006-0.0480.6060.0400.8570.1500.0290.0390.0000.0000.1371.000-0.1790.0390.0880.034
rate_of_interest-0.0020.1650.039-0.0000.5830.0300.1770.0121.0001.0000.025-0.0520.0300.0830.0500.0441.0000.0270.062-0.0900.134-0.1720.0530.1920.2200.0120.1910.459-0.1791.0000.1240.1910.058
submission_of_application0.0060.0220.3600.0020.2990.0060.0510.1450.0000.0000.1210.1650.2700.0810.0900.0620.0000.0460.0560.0200.0100.4110.0110.2650.1100.0130.0670.0450.0390.1241.0000.1730.070
term-0.0050.2200.045-0.002-0.1660.2100.1990.0280.0000.0000.102-0.1160.0560.0470.1020.0180.0000.0340.108-0.0410.0240.1960.0750.1070.1050.0130.0240.0250.0880.1910.1731.0000.013
total_units0.0030.0080.0130.0040.0270.0000.0140.0060.0000.0000.0290.0370.0070.0040.0260.0000.0000.0000.0140.0170.0030.0790.0080.0170.0280.0000.1690.0050.0340.0580.0700.0131.000

Missing values

2025-12-28T18:59:10.155962image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-12-28T18:59:10.805099image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2025-12-28T18:59:12.016829image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

IDyearloan_limitGenderapprov_in_advloan_typeloan_purposeCredit_Worthinessopen_creditbusiness_or_commercialloan_amountrate_of_interestInterest_rate_spreadUpfront_chargestermNeg_ammortizationinterest_onlylump_sum_paymentproperty_valueconstruction_typeoccupancy_typeSecured_bytotal_unitsincomecredit_typeCredit_Scoreco-applicant_credit_typeagesubmission_of_applicationLTVRegionSecurity_TypeStatusdtir1
0248902019cfSex Not Availablenopretype1p1l1nopcnob/c116500NaNNaNNaN360.0not_negnot_intnot_lpsm118000.0sbprhome1U1740.0EXP758CIB25-34to_inst98.728814southdirect145.0
1248912019cfMalenopretype2p1l1nopcb/c206500NaNNaNNaN360.0not_negnot_intlpsmNaNsbprhome1U4980.0EQUI552EXP55-64to_instNaNNorthdirect1NaN
2248922019cfMalepretype1p1l1nopcnob/c4065004.5600.2000595.00360.0neg_ammnot_intnot_lpsm508000.0sbprhome1U9480.0EXP834CIB35-44to_inst80.019685southdirect046.0
3248932019cfMalenopretype1p4l1nopcnob/c4565004.2500.6810NaN360.0not_negnot_intnot_lpsm658000.0sbprhome1U11880.0EXP587CIB45-54not_inst69.376900Northdirect042.0
4248942019cfJointpretype1p1l1nopcnob/c6965004.0000.30420.00360.0not_negnot_intnot_lpsm758000.0sbprhome1U10440.0CRIF602EXP25-34not_inst91.886544Northdirect039.0
5248952019cfJointpretype1p1l1nopcnob/c7065003.9900.1523370.00360.0not_negnot_intnot_lpsm1008000.0sbprhome1U10080.0EXP864EXP35-44not_inst70.089286Northdirect040.0
6248962019cfJointpretype1p3l1nopcnob/c3465004.5000.99985120.00360.0not_negnot_intnot_lpsm438000.0sbprhome1U5040.0EXP860EXP55-64to_inst79.109589Northdirect044.0
7248972019NaNFemalenopretype1p4l1nopcnob/c2665004.1250.29755609.88360.0not_negnot_intnot_lpsm308000.0sbprhome1U3780.0CIB863CIB55-64to_inst86.525974Northdirect042.0
8248982019cfJointnopretype1p3l1nopcnob/c3765004.8750.73951150.00360.0not_negnot_intnot_lpsm478000.0sbprhome1U5580.0CIB580EXP55-64to_inst78.765690centraldirect044.0
9248992019cfSex Not Availablenopretype3p3l1nopcnob/c4365003.490-0.27762316.50360.0not_negnot_intnot_lpsm688000.0sbprhome1U6720.0CIB788EXP55-64to_inst63.444767southdirect030.0
IDyearloan_limitGenderapprov_in_advloan_typeloan_purposeCredit_Worthinessopen_creditbusiness_or_commercialloan_amountrate_of_interestInterest_rate_spreadUpfront_chargestermNeg_ammortizationinterest_onlylump_sum_paymentproperty_valueconstruction_typeoccupancy_typeSecured_bytotal_unitsincomecredit_typeCredit_Scoreco-applicant_credit_typeagesubmission_of_applicationLTVRegionSecurity_TypeStatusdtir1
1486601735502019cfFemalenopretype1p4l1nopcnob/c3665003.875-0.11713643.16360.0not_negnot_intnot_lpsm658000.0sbprhome1U7200.0CIB851EXP45-54not_inst55.699088Northdirect020.0
1486611735512019cfSex Not Availablenopretype2p4l1nopcb/c346500NaNNaNNaN360.0not_negnot_intnot_lpsm358000.0sbprhome1UNaNEXP585CIB25-34to_inst96.787710southdirect1NaN
1486621735522019cfJointnopretype1p4l1nopcnob/c6465003.6250.07437639.80360.0not_negint_onlynot_lpsm828000.0sbprhome1U13500.0CIB873EXP45-54not_inst78.079710Northdirect031.0
1486631735532019cfMalenopretype2p1l1nopcb/c106500NaNNaNNaN360.0not_negnot_intnot_lpsmNaNsbprhome1U1860.0EQUI619EXP<25to_instNaNNorthdirect1NaN
1486641735542019cfJointnopretype2p1l1nopcb/c1565003.9901.40153113.06360.0not_negnot_intnot_lpsm158000.0sbprhome1U4020.0EXP859EXP65-74to_inst99.050633centraldirect045.0
1486651735552019cfSex Not Availablenopretype1p3l1nopcnob/c4365003.1250.25719960.00180.0not_negnot_intnot_lpsm608000.0sbprhome1U7860.0CIB659EXP55-64to_inst71.792763southdirect048.0
1486661735562019cfMalenopretype1p1l1nopcnob/c5865005.1900.85440.00360.0not_negnot_intnot_lpsm788000.0sbirhome4U7140.0CIB569CIB25-34not_inst74.428934southdirect015.0
1486671735572019cfMalenopretype1p4l1nopcnob/c4465003.1250.08161226.64180.0not_negnot_intnot_lpsm728000.0sbprhome1U6900.0CIB702EXP45-54not_inst61.332418Northdirect049.0
1486681735582019cfFemalenopretype1p4l1nopcnob/c1965003.5000.58244323.33180.0not_negnot_intnot_lpsm278000.0sbprhome1U7140.0EXP737EXP55-64to_inst70.683453Northdirect029.0
1486691735592019cfFemalenopretype1p3l1nopcnob/c4065004.3751.38716000.00240.0not_negnot_intnot_lpsm558000.0sbprhome1U7260.0CIB830CIB45-54not_inst72.849462Northdirect044.0